[SPARK-37452][SQL][3.1] Char and Varchar break backward compatibility between v3.1 and v2 by yaooqinn · Pull Request #34736 · apache/spark

yaooqinn · 2021-11-29T09:52:21Z

This backports #34697 to 3.1

What changes were proposed in this pull request?

We will store table schema in table properties for the read-side to restore. In Spark 3.1, we add char/varchar support natively. In some commands like create table, alter table with these types, the char(x) or varchar(x) will be stored directly to those properties. If a user uses Spark 2 to read such a table it will fail to parse the schema.

FYI, https://github.com/apache/spark/blob/branch-2.4/sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataType.scala#L136

A table can be a newly created one by Spark 3.1 and later or an existing one modified by Spark 3.1 and on.

Why are the changes needed?

backward compatibility

Does this PR introduce any user-facing change?

That's not necessarily user-facing as a bugfix and only related to internal table properties.

How was this patch tested?

manully

…patibility between v3.1 and v2

SparkQA · 2021-11-29T11:13:23Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50178/

SparkQA · 2021-11-29T11:53:29Z

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50178/

SparkQA · 2021-11-29T12:25:46Z

Test build #145708 has finished for PR 34736 at commit efe00f7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun

+1, LGTM.

… between v3.1 and v2 This backports #34697 to 3.1 ### What changes were proposed in this pull request? We will store table schema in table properties for the read-side to restore. In Spark 3.1, we add char/varchar support natively. In some commands like `create table`, `alter table` with these types, the `char(x)` or `varchar(x)` will be stored directly to those properties. If a user uses Spark 2 to read such a table it will fail to parse the schema. FYI, https://github.com/apache/spark/blob/branch-2.4/sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataType.scala#L136 A table can be a newly created one by Spark 3.1 and later or an existing one modified by Spark 3.1 and on. ### Why are the changes needed? backward compatibility ### Does this PR introduce _any_ user-facing change? That's not necessarily user-facing as a bugfix and only related to internal table properties. ### How was this patch tested? manully Closes #34736 from yaooqinn/PR_TOOL_PICK_PR_34697_BRANCH-3.1. Authored-by: Kent Yao <yao@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

… between v3.1 and v2 This backports apache#34697 to 3.1 ### What changes were proposed in this pull request? We will store table schema in table properties for the read-side to restore. In Spark 3.1, we add char/varchar support natively. In some commands like `create table`, `alter table` with these types, the `char(x)` or `varchar(x)` will be stored directly to those properties. If a user uses Spark 2 to read such a table it will fail to parse the schema. FYI, https://github.com/apache/spark/blob/branch-2.4/sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataType.scala#L136 A table can be a newly created one by Spark 3.1 and later or an existing one modified by Spark 3.1 and on. ### Why are the changes needed? backward compatibility ### Does this PR introduce _any_ user-facing change? That's not necessarily user-facing as a bugfix and only related to internal table properties. ### How was this patch tested? manully Closes apache#34736 from yaooqinn/PR_TOOL_PICK_PR_34697_BRANCH-3.1. Authored-by: Kent Yao <yao@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

yaooqinn added 2 commits November 29, 2021 17:46

[SPARK-37452][SQL][BACKPORT][3.1] Char and Varchar break backward com…

cb0e544

…patibility between v3.1 and v2

[SPARK-37452][SQL][BACKPORT][3.1] Char and Varchar break backward com…

efe00f7

…patibility between v3.1 and v2

github-actions bot added the SQL label Nov 29, 2021

cloud-fan approved these changes Nov 29, 2021

View reviewed changes

dongjoon-hyun approved these changes Nov 30, 2021

View reviewed changes

dongjoon-hyun changed the title ~~[SPARK-37452][SQL][BACKPORT][3.1] Char and Varchar break backward compatibility between v3.1 and v2~~ [SPARK-37452][SQL][3.1] Char and Varchar break backward compatibility between v3.1 and v2 Nov 30, 2021

dongjoon-hyun closed this Nov 30, 2021

yaooqinn deleted the PR_TOOL_PICK_PR_34697_BRANCH-3.1 branch October 25, 2024 05:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-37452][SQL][3.1] Char and Varchar break backward compatibility between v3.1 and v2#34736

[SPARK-37452][SQL][3.1] Char and Varchar break backward compatibility between v3.1 and v2#34736
yaooqinn wants to merge 2 commits intoapache:branch-3.1from
yaooqinn:PR_TOOL_PICK_PR_34697_BRANCH-3.1

yaooqinn commented Nov 29, 2021

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

dongjoon-hyun left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yaooqinn commented Nov 29, 2021

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

SparkQA commented Nov 29, 2021

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants